In search of the unknown user: indexing, hypertext and the world wide web
نویسندگان
چکیده
For the purposes of this article, the indexing of information is interpreted as the pre-processing of information in order to enable its retrieval. This definition thus spans a dimension extending from classification-based approaches (pre-co-ordinate) to keyword searching (post-co-ordinate). In the first section we clarify our use of terminology, by briefly describing a framework for modelling IR systems in terms of sets of objects, relationships and functions. In the following three sections, we discuss the application of indexing functions to document collections of three specific types: (1) ‘conventional’ text databases; (2) hypertext databases; and (3) the World Wide Web, globally distributed across the Internet.
منابع مشابه
A Technique for Improving Web Mining using Enhanced Genetic Algorithm
World Wide Web is growing at a very fast pace and makes a lot of information available to the public. Search engines used conventional methods to retrieve information on the Web; however, the search results of these engines are still able to be refined and their accuracy is not high enough. One of the methods for web mining is evolutionary algorithms which search according to the user interests...
متن کاملThe Hidden Web
World Wide Web by browsing hypertext documents has led to the development and deployment of various search engines and indexing techniques. However, many information-gathering tasks are better handled by finding a referral to a human expert rather than by simply interacting with online information sources. A personal referral allows a user to judge the quality of the information he or she is re...
متن کاملStructural Abstractions of Hypertext Documents for Web-Based Retrieval
There have been connicting views in the literature on the capability of tools and mechanisms for storing and accessing information in Internet. On one hand it has been criticized for a long time that World Wide Web ooers a chaotic environment for Web agents to extract information because the description of a document by HTML is friendly for humans to understand, but is not so to machines. On ot...
متن کاملHierarchical Fuzzy Clustering Semantics (HFCS) in Web Document for Discovering Latent Semantics
This paper discusses about the future of the World Wide Web development, called Semantic Web. Undoubtedly, Web service is one of the most important services on the Internet, which has had the greatest impact on the generalization of the Internet in human societies. Internet penetration has been an effective factor in growth of the volume of information on the Web. The massive growth of informat...
متن کاملAnchor point indexing in Web document retrieval
Traditional World Wide Web search engines, such as AltaVista.com, index and recommend individual Web pages to assist users in locating relevant documents. As the Web grows, however, the number of matching pages increases at a tremendous rate. Users are often overwhelmed by the large answer set recommended by the search engines. Also, if a matching document is a hypertext, the document structure...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Journal of Documentation
دوره 54 شماره
صفحات -
تاریخ انتشار 1998